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FAILURE OF THE POINTWISE AND MAXIMAL ERGODIC 
THEOREMS FOR THE FREE GROUP 

TERENCE TAG 


Abstract. Let F2 denote the free group on two generators a, b. For any measure¬ 
preserving system {X,X, pL,{Tg)g^p^) on a finite measure space X = (A, A,/i), any 
/ G L^(A), and any n ^ 1, define the averaging operators 

A^f{x) := ^ ^ 3„_i 2 f{T;^x), 

g^F2-\g\=n 

where |g| denotes the word length of g. We give an example of a measure-preserving 
system X and an / G L^{X) such that the sequence Anfix) is unbounded in n for 
almost every x, thus showing that the pointwise and maximal ergodic theorems do not 
hold in for actions of Ej. This is despite the results of Nevo-Stein and Bufetov, 
who establish pointwise and maximal ergodic theorems in for p > 1 and for L log L 
respectively, as well as an estimate of Naor and the author establishing a weak-type 
(1,1) maximal inequality for the action on A(F 2 ). Our construction is a variant of a 
counterexample of Ornstein concerning iterates of a Markov operator. 


1. Introduction 

Let F2 denote the free non-abelian group on two generators a, b. Define a reduced word 
to be a word with letters in the alphabet in which a,a~^ and b,b~^ are 

never adjacent, and for each g e F2, dehne the word length of to be the length of 
the unique reduced word that produces g. We let denote the index 2 subgroup of 
F2 consisting of g e F2 with even word length. 

Dehne a F2-system to be a quadruple where {X,X,g,) is a measure 

space with 0 < n{X) < 00, and Tg-. X —* X is a family of measure-preserving maps on 
X for e F2, with Ti the identity and TgT^ = Tg^ for all g,h e F2; in particular, the 
Tg are bi-measurable with T~^ = Tg-i. One can of course normalise such systems to 
have total measure 1 by dividing ji by but (as we will eventually be gluing several 

systems together) it will be convenient not to always insist on such a normalisation. As 
the free group F2 has no relations, such a system can be prescribed by specifying two 
arbitrary invertible bi-measurable measure-preserving maps Ta,Tfe : X ^ X, and then 
dehning Tg for all other g e G in the obvious fashion. 

We say that an F2-system is F2-ergodic if all F2-invariant measurable sets either have 
zero measure or full measure, and F^-ergodic if the same claim is true for F|-invariant 
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measurable sets. For any / e L^{X) = and any n ^ 1, we define the 

averaging operators 

A/(t) := 

geF2-.\g\=n 

note that 4 x 3"“^ is the number of reduced words of length n. One can of course use 
symmetry to replace T~^ by Tg if desired. 

The pointwise convergence of the operators An was studied by Nevo and Stein |8] and 
Bufetov [3], who (among other things) proved the following result: 

Theorem 1.1 (Pointwise ergodic theorem). Let {Tg)gf;F2) be an F2-system. If 

|/|log(2 + I/I) dfi < CO, then A2nf converges pointwise almost everywhere (and in 
L^{X) norm) to an Ff-invariant function. In particular, if {X, X, p., (Tg)geF2) is Ff- 
ergodic, then A2nf converges pointwise almost everywhere and in to the constant 


The restriction to even averages A2n, and the use of F| instead of F2, can be seen to 
be necessary by considering the simple example in which X is a two-element set {0,1} 
(with uniform measure) and Ta,Ti, interchange the two elements 0, 1 of this set. The 
original paper of Nevo and Stein |H] established this theorem for / e L^(X) for some 
p > 1, by modifying the methods of Stein [12]. The subsequent paper of Bufetov [3] 
used instead the “Alternierende Verfahren” of Rota m to cover the L logL case. Both 
arguments also extend to several other group actions (see e.g. 0. ffl. 0). but for 
simplicity of exposition we shall focus only on the F2 case. We also remark that both 
arguments also give bounds on the associated maximal operator / >--> sup„.4,„|/|. See 
also n. 0 for an alternate approach to pointwise ergodic theorems in and LlogL. 

In [H] the question was posed as to whether the above pointwise ergodic theorem ex¬ 
tended to arbitrary L^{X) functions. The main result of this paper answers this question 
in the negative: 

Theorem 1.2 (Counterexample). There exists an F2-system {X,X,p, {Tg)g^F2) 

/ e L^{X) such that sup„ \A2nf{x)\ = 00 for almost every x e X. In particular, A2nf{x) 
fails to converge to a limit as n —» 00 for almost every x e X. 


As such, there is no pointwise ergodic theorem or maximal ergodic theorem in for 
actions of the free group F 2 . Our construction also applies to free groups Fr on r 
generators for any r ^ 2; we leave the modification of the arguments below to this more 
general case to the interested reader. This result stands in contrast to the situation for 
the regular action of F 2 on A{F 2 ), for which a weak-type (1,1) for the maximal operator 
was established by Naor and the author [7] Theorem 1.5]. Note that the estimate for 
A{F 2 ) does not transfer to arbitrary F 2 -systems due to the non-amenability of the free 
group F 2 . 

Because the sphere {g e F2 : \g\ = n} is a positive fraction of the ball {g e F2 : \g\ < n}, 
the above result also holds if the average over spheres is replaced with an average over 
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balls, or with regards to other minor variations of the spherical averaging operator such 
as \An + ^An+i- This negative result for averaging on balls stands in contrast with the 
situation for amenable groups, for which pointwise and maximal ergodic results in 
are established for suitable replacements of balls, such as tempered Fplner sets; see [5]. 
On the other hand, if one considers the Cesaro means ^ '^n<N spherical averages 
on T 2 -systems, then pointwise and maximal ergodic theorems in were established in 
0 . 


Our construction is inspired by a well-known counterexample of Ornstein [TO] demon¬ 
strating the failure of the maximal ergodic theorem in for iterates of a certain 
well-chosen self-adjoint Markov operator. Roughly speaking, the function / in Orn- 
stein’s example consists of many components /j, each of which comes with a certain 
“time delay” that ensures that the dynamics of P^fi only become signihcant after a 
signihcant period of time - in particular, long enough for the dynamics of other com¬ 
ponents of the function to have achieved “mixing” in the portion of X where the most 
interesting portion of the dynamics of P'^fi takes place, allowing the amplitude of fi to 
be slightly smaller than would otherwise have been necessary to make sup„ P"/ large. 
To adapt this construction to the setting of P 2 -systems, we need to glue together var¬ 
ious P 2 -systems that have the capability to produce such a “time delay”. We will be 
able to construct such systems by basically taking an “infinitely large ball” in P 2 , glu¬ 
ing the boundary of that ball to itself, and redefining the shift maps on the boundary 
appropriately. Somewhat ironically, the positive results in Theorem 1.1 play a helpful 
supporting role in establishing the negative result in Theorem 1.2[ by establishing the 
“mixing” referred to previously that is an essential part of Ornstein’s construction. 


1.1. Acknowledgments. The author is supported by NSF grant DMS-1266164 and 
by a Simons Investigator Award, and thanks Lewis Bowen for helpful discussions and 
corrections. 


2. Initial reductions 


We begin by reducing Theorem |1.2| to the following more quantitative statement. 

Theorem 2.1 (Quantitative counterexample). Let > 0. Then there exists an P 2 - 
system (A, A’,/i, (Tg)gsp^) and a non-negative function f e such that 

||/|| li ( a ) < afi{x) 


but such that 


supA2nf{x) ^1-e 

n 

for all X s X outside of a set of measure at most efi{X). 


Let us see how Theorem 2.1 implies Theorem 1.2 


normalise /i(A) = 1 in Theorem 


2.1 


By dividing /r by /i(A) we may 
Applying the above theorem with a = e = 2 “”*, 
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we can thus find for each natural number m, an F 2 -system (X^, Xm, iim, (Tg^m)geF 2 ) with 
^rn{Xm) = 1, and a non-negative function fm e L^{Xm) such that 

||/m||Li(X™) < 2“”" 

and 

SUp^2n/m(T) ^ 1 - 2~™ 
n 

outside of a set of measure 2~^. 


Let (X, Xj/r, (Tg)geF 2 ) be the product system, thus X is the Cartesian product X : = 
Yim^rn with product (j-algebra X := Yim^rm product probability measure fi : = 
Y\m ^^^1 product action Tg := Each fm e L^{Xm) then lifts to a function 

fm e L”(X) with 

||/m||Ll(A) < 2 ™ 


and 


SUp^2n/m(T) ^1-2 ^ 1/2 


outside of a set of measure 2 If we then set / := then / e L^(X), and 

from the pointwise inequality 


sup^2n/(a:) ^ moSup^2n/m(a:) 

n n 


for all m ^ mo and the Borel-Cantelli lemma we see that sup„ ^ 2 ^/( 2 ^) is larger than 
mo/2 for almost every x and any given mo, which yields the claim. 


It remains to prove Theorem 2.1 In order to adapt the arguments of Ornstein HDl , we 
would like to interpret the averaging operators An as powers of a Markov operator P. 
This is not true as stated, since we do not quite have the semigroup property AnAm = 
An+m (although AnAm does contain a term of the form |Mn+m)- However, as observed 
by Bufetov |3], we can recover a Markov interpretation for An by lifting X up to a four¬ 
fold cover X that tracks the “outward normal vector” for the sphere. More precisely, 
given an F 2 -system (X, X,//, {Tg)g^p^), we define the lifted measure space {X,X,il) to 
be the product of (X, X,/i) and the four-element space {a, b, a~^, b~^} with the uniform 
probability measure; in particular /i(X) = /i(X). Let vr: X 
operator 7i{x,s) := x; this induces a pushforward operator tt* 
pullback operator vr* : L^(X) ^ -L^(X) by the formulae 


X be the projection 
L\X) L\X) and a 


■= ^ 




sG{a,b,a ^,b 


and 


F*f{x,s) := f{x) 

for / e L^(X) and / e L^(X). We also define the Markov operator P: L^(X) 
by 


s'G{a,b,a-^ ,b-^}:s'^s-^ 


L\X) 


One can view P as the Markov operator associated to the Markov chain that for each 
unit time, moves a given point (x, s) of X to one of the three points {Tg/x, s') with s' e 
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{a,b,a ^,b ^}\{s chosen at random. By writing the elements of {g e F 2 : \g\ = n} 
as rednced words of length n, one can easily verify the identity 

Anf = TT.PVf 

for any / e and n ^ 1. It thus suffices to show 

Theorem 2.2 (Quantitative counterexample, again). Let a,e > 0. Then there exists 
an F 2 -system {Tg)gsF^) and a non-negative function f e such that 

ll/llLfiv) ^ (y.gj{X) 

but such that 

sup 7r*P^"'/(x) ^ 1 — e 

n 

for all X e X outside of a set of measure at most eg,{X). 


Indeed, by setting / : 
||/||li(a) = 4||/||2^i(x), 


= 471,)=/, and noting the 
we obtain Theorem 2.1 


lointwise bound f ^ f and the identity 
(after replacing a by a/A). 


For inductive reasons, we will prove a technical special case of Theorem 2^, in which 
the F 2 -system is of a certain “good” form, and the sequence {P^f)n^o is part of an 
“ancient Markov chain” {fn)nez that extends to arbitrarily negative times as well as 
arbitrarily positive times. More precisely, let us define a good system to be an F 2 - 
system (X, X, /r, {Tg)gf;F 2 ) which admits a decomposition X = Xq u X;, u Xq into three 
disjoint sets X^, Xf,, Xq admitting the following (somewhat technical) properties: 


(i) (Measure) One has /i(Xa) = /i(Xft) = |/i(X) and /i(Xo) = t^(X). Furthermore, 

for any 0 < a < one can find a measurable subset of Xh of measure 

exactly equal to k. 

(ii) (Invariance) One has T^Xa = X^ and TbXb = Xb- Also, one has the inclusions 
T,Xb C TbXa u r,-ix, c Xo. 

(iii) (Ergodicity) One can partition X^ into finitely many Ta-invariant components 
Xa 1 ,..., Xa,m of positive measure, such that T/ is ergodic on each of the compo¬ 
nents Xq p that is, the only T/-invariant measurable subsets of Xa^i have measure 
either 0 or /i(Xa^j). 

(iv) (Generation) One has X = UgeF 2 each i = 1,... ,m. 


Note that relatively few conditions are required on the dynamics on Xb, in particular, 
the ergodicity hypotheses on the system are located in the disjoint region X^. This 
will allow us to easily modify the dynamics on Xb in order to “glue” two good systems 
together in Section]^ 


See Figure We will construct good systems in subsequent sections. For now, we 
record one useful property of such systems: 


Lemma 2.3 (Pointwise ergodic theorem for good systems). Every good system (X, X, fx, (Tg)geF 2 ) 


is Ff-ergodic. In particular (by Theorem 1.1), for any f e 

1 


converge pointwise almost everywhere and in norm to 


EX) 


(X), the averages A 2 nf 
f dfi. Furthermore, 
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X 


0 




T 

b 


Figure 1. A somewhat schematic depiction of a good system. Only part 
of the action of Ta and are displayed. 


for any f e ^Q^verge pointwise almost everywhere and in norm to 

■jix) k / dis¬ 


proof. Let / e L“(X) be an P|-invariant function; to establish P|-ergodicity, it will 
suffice to show that / is constant almost everywhere. As / is T^-invariant, we see from 
Axiom (iii) that / is constant almost everywhere on each Xa^i- Since F2 = u Ffa, 
we see from Axiom (iv), the Ta-invariance of Xa,i, and the P 2 ^-invariance of / that / is 
constant almost everywhere on X, as required. The hnal claim does not quite follow 
from Theorem |T3 but is immediate from [3l Proposition 1]. □ 


For any a > 0, let P{a) denote the following claim: 

Claim 2.4 (P(a)). For any e > 0, there exists a good system (X, A,/!, (Tg)geF2) with 
associated decomposition X = X^ u X;, u Xq, and a seguence of non-negative functions 
fn e L°°(X) for n e Z with the following properties: 


(v) (Ancient Markov chain) fn+i = Pfn for all n e Z. Eguivalently, one has fn+m = 
P^ fn for all n e Z and m eN. In particular, ||/n||Li(A) independent of n. 

(vi) (Size) One has H/nllipA) = Oipi{X) for some n e Z (and hence for all n e Z). 

(vii) (Early support) fn is supported in Xq for all negative n. Eurthermore, there 
exists a finite A > 0 such that fn is supported in a set of measure at most 
A3^fi{X) for all negative n. 
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(viii) (Large maximum function) We have 

sup7r^/2n(x) ^ 1 - £ 

neZ 

for all X e X outside of a set of measure at most e^{X). 


Note that our sequence /„ is ancient in the sense that it extends to arbitrary negative 
times n —00 as well as to arbitrary positive times n —> 00 . This will be essential 
in order to set up suitable “time delays” in our arguments in later sections. One can 
informally think of the /„ as the (normalised) distribution at time n of an ancient 
Markov process that starts from an inhnitely small location deep inside Xq at inhnite 
negative time n = — 00 , and only escapes Xq at or after time n = 0, and which covers 
most of X with density roughly 1 or more at some point in time (but crucially, different 
regions of X may be covered in this fashion at different times). 


Observe that if P{o) holds for an arbitrarily small set of a > 0, and £ > 0 is arbitrary, 
then from axioms (vii), (viii), one has for any N that 

sup 7r^f2n(x) ^1-e 

n^—2N 


for all x G X outside of a set of measure at most (^ + Taking N large 

(after 


2.2 


enough (depending on e, A) and setting / := /_ 2 Ar, we obtain Theorem 
adjusting e as necessary). It thus suffices to show that P{a) holds for arbitrarily small 
a > 0. This will be accomplished using the following two key theorems (the second of 
which being a variant of [Tnl Lemma 4]): 


Theorem 2.5 (Initial construction). The claim P(l) is true. 


Theorem 2.6 (Iteration step). Suppose that P{(y) holds for some 0 < a < 1. Then 
P{a{l — |)) is true. 


From Theorem |2.5| and Theorem 12.61 we see that the inhmum of all 0 < a < 1 for which 
F(a) holds is zero, and the claim follows. Thus it suffices to establish Theorem 2.5 and 
Theorem 2.6 This will be accomplished in the next two sections. 


3. The initial construction 


We now prove Theorem 2.5 We will in fact construct an example of a good system 


(X, T,/r, {Tg)geF 2 ) and functions /„ which witness F(l) for all e > 0 at once. 


We begin by constructing an appropriate measure space (X, T,/x). For each integer n, 
let Yn denote the space of half-inhnite reduced words (sm)m^n = SnSn+iSn +2 ..., in which 
each of the Si are drawn from the alphabet {a, b, and a, a~^ and b, b~^ are never 

adjacent. We give this space the product a-algebra (that is, the minimal a-algebra 
for which the coordinate maps (sm)ms;n ^ Sm are all measurable). By the Kolmogorov 
extension theorem, we may construct a probability measure fin on W such that each 
hnite reduced subword ... Sn+k for k ^ 0 occurs as an initial segment with probability 
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one can view this measnre as the law of the random half-infinite rednced word 
constructed by choosing uniformly at random from then recursively 

selecting Sn+i+i for i = 0 , 1 , 2 ,... to be drawn uniformly from {a, b, a~^, 

The disjoint union Y := of the Yn admits an action {Sg)geF 2 of F 2 , with the 

action Sg of a generator s e {a, b, a~^, defined by setting 

*S*s(Sri'Sn+l'5n+2 • • • ) • ■SS)2Sn,-|_iS)2-|_2 • • • £ Y^—l 

for SnSn+iSn +2 ''' ^ Yn and s e {a, 6 , and 

*S*s(Sn'5n+l'5n+2 • • • ) ■ ^n+l^n+2 ’ ' ' ^ Y^—l 

for s„s„+iS „+2 ■ ■ ■ ^ Yn and s = s~^; thus Sg is the operation of formal left-multiplication 
by g, after reducing any non-reduced words. If we give Y the measure p.y := XinGZ 
then one can easily verify that this action is measure-preserving. Unfortunately, fiy is 
an inhnite measure due to the contribution of the negative n, and so this space is not 
quite suitable for our needs. Instead, we shall work with a certain subquotient of Y, 
defined as follows. 

Firstly, we restrict Y to the space ^ ^ ® ho, which can be thought of as a 

suitably rescaled limit of an “inhnitely large ball” in F 2 , with Yq being the “boundary” of 
this ball, and the Yn lying increasingly deeper in the “interior” of the ball as n increases 
(see Figure [^. This makes the shift maps Sg, s e {a, 6 , a“^, b~^] partially undefined on 
the Yq boundary, but we will £x this later by redefining these maps on (a quotient) of 
To- Next, we introduce a reflection operation a; a; on the boundary Yq by mapping 

S0'Si'S2- • • := So .... 

It is clear that this map preserves the measure /xo- If we then form the quotient space 
ho/ {{ 2 ^ 5 T} : X e Uq}, we can obtain a probability measure ^iq/ ~ on Yq/ ~ by 
pushing forward the probability measure hq under the quotient map. We observe that 
Yq/ ~ splits into two components of equal measure 1/2, namely ((S'aFi n Tq) (>S'T^hi n 
Yq))/ ~ and ((S'^Yi n Yq) u (5'^^hj n Yq))/ noting that the sets S'^Yi n Yo) n Yq 

are disjoint reflections of each other, and similarly for S'^hh n Yq, S/'^Yi n Uo- 

We then dehne X to be the quotient space Un ® (ho/ ~) with measure /i : = 

+ KW thus 

^,(X) = 2 3-" + j = 1. 

n^l 

We set Xo := [+|,>i Un, W, := ((F^hh n U,) u (^."'hj n Yq))/ and X,, := ((5„hh n 
Uo) u (^-Wi n Yq))/ Thus 

/.(Xo) = 2 3-" = J 

and fi{Xa) = /i(Xft) = One can think of Xq as the “interior” of X, with X^ and X^ 
being two equally sized pieces of the “boundary” of Xo. Also, X^, Xf, are Cantor spaces 
(and /r is a Cantor measure on such spaces), and so one can easily construct measurable 
subsets of Xh of arbitrary measure between 0 and g.{Xb). Thus Axiom (i) is satisfied. 
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Figure 2. A fragment of the infinite measure space Y. The centre disk 
represents a portion of Yi consisting of reduced words S 1 S 2 ... with initial 
letter si = a. The remaining disks are images of this disk under shifts by 
various elements of F 2 , and all have equal measure with respect to /iy. 

This image should be compared with the infinite tree that is the Cayley 
graph of F 2 . 

Also, one can easily create a measure-preserving invertible map : Xa —>■ Xa such that 
is ergodic on Xa', this can be done for instance by identifying Xa (which is an 
atomless standard probability space) as a measure space (up to null sets) with the unit 
circle with Haar measure, and then setting T° to be an irrational translation map. 



10 


TERENCE TAO 


We now define the shifts Ta'. X ^ X and : X —> X as follows. 

(1) If a; e Xq, then is dehned to be SaX projected onto X, and T^x is similarly 
dehned to be Sb projected onto X. (The projection is only necessary of course 
if SaX or SbX lands in To-) 

(2) If X e Xa, then T^x := T°x. If instead x e Xt, TaX is dehned to be Sax' e Yi, 

where x' e n Yq is the lift of x to n Xq- 

(3) If X e Xb, then T^x = x. If instead x e X^, T^x is dehned to be Sbx' e Xi, where 
x' e 5'^^Xi n Xq is the lift of x to 5'jJ"^Xi n Xq. 


One then dehnes Tg for the remaining g e F 2 in the usual fashion. In particular, one 
sees that for any x in the interior Xq and any s e {a,b,a~^,b~^}, TgX is equal to SgX 
projected onto X. Informally, the shifts Tg: X ^ X for s e {a, b, a~^, b~^} are inherited 
from the shifts Sg'- Y Y except for the boundary actions of Ta,T”^ on Xq and of 
Tb, on Xb, which are given by Tq (and its inverse) and the identity map respectively. 
(There is nothing special about the identity map here; an arbitrary measure-preserving 
map on Xb could be substituted here for our purposes.) 

Proposition 3.1. (X, X,/!, {Tg)gep^) is a good system. 


Proof. It is a routine matter to verify that T^, Tb are invertible and measure-preserving, 
so that (X, X,/i, (Tg)gsp^) is an F 2 -system. Axiom (i) was already verihed. For Axiom 
(ii), we note that T^Xb c c (S'bXo n Xi) u (5'^^Xo n Xi) = TbXo u T^f^Xi, as required. 
We set m = 1 and X^ 1 := Xa, then Axiom (iii) is true from construction, and Axiom 
(iv) is also easily verihed. □ 


It remains to construct a sequence /„ of non-negative functions in L°°(X) for each n e Z 
obeying Axioms (v)-(viii) with a = 1. For negative n, we dehne fn by setting 

fn{x,s) := 4 X 3"” 

whenever x e X and s e {a,b,a^^,b~^} are such that x e X_„ and SgX e X_„_i, and 
fn{x,s) = 0 otherwise. These are clearly non-negative functions in L“(X) obeying 
Axiom (vii). It is routine to verify that fn+i = Pfn for all n < —2. If we then dehne 
fn for non-negative n by the formula 

/„ := 


then we have Axiom (v). For negative n we have 

II/?i|Il1(X) = 


which g ives Axiom 
Lemma 


2.3 


so Axiom 

[H 


(vi) (using Axiom (v) to extend to non-negative n). Finally, from 
we see that /„ converges pointwise almost everywhere to 1 as n ^ -I- 00 , and 
vii) follows from Egorov’s theorem. This concludes the proof of Theorem 
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4. The iteration step 


We now prove Theorem 2.6 Let 0 < a < 1 be such that P{o.) holds. By Claim 2.4 
(with e replaced by e/4), and normalising X to have measure 1, we may hud a good 
system (X, T, /i, (Tg)gep^) with associated decomposition X = X^uXbuXo and measure 
/i(X) = 1, and a sequence of non-negative functions /„ e L“(X) for n e Z with the 
following properties: 


(v) (Ancient Markov chain) fn+i = Pfn for all n e Z. 

(vi) (Size) One has ||/n|lLi(x) = ^ n e Z. 

(vii) (Early support) fn is supported in Xq for all negative n. Furthermore, there 
exists a finite A > 0 such that fn is supported in a set of measure at most A3"' 
for all negative n. 

(viii) (Large maximum function) We have 

sup7r=,/2„(a;) ^ 1 - e/4 

for all T e X outside of a set of measure at most e/4. 


It will suffice to construct a good system (X', X', /i', {Tg)geFf) with associated decompo¬ 
sition X' = X^ u X^ u Xq, Markov operator P', and measure /i'(X') = 2, and a sequence 
of non-negative functions // e L°°(X') for n e Z with the following properties: 

(v’) (Ancient Markov chain) = P'fn for all n e Z. 

(vi’) (Size) One has \\fn\\L^(x') = ci(2 — f) for all n e Z. 

(vii’) (Early support) // is supported in Xq for all negative n. Furthermore, there 
exists a hnite A' > 0 such that fn is supported in a set of measure at most 2A'3"' 
for all negative n. 

(viii’) (Large maximum function) We have 

sup7r=,/2„(P) ^ 1 - £ 

neZ 

for all x' e X' outside of a set of measure at most 2e. 


We construct this system as follows. First, from Axiom (viii) and Egorov’s theorem, we 
may hud a natural number N such that 

sup Ti^f 2 n{x) ^ 1 - e/3 (4.1) 

-N^n^N 

for all a: e X outside of a set of measure at most e/3. We let 0 < a < 1/4 be a small 
quantity depending on e, N and the fn to be chosen later. We will construct the good 
system {X/X/f, (T^)ggi? 2 ) to be two copies of (X, A’,/r, (Tg)geF 2 ) glued together by a 
small amount of coupling, with the k parameter measuring the amount of coupling. 
More precisely, we dehne the measure space {X',X',fi') to be the product of {X,X,fi) 
with the two-element set {1,2} with counting measure. Next, using Axiom (i), we can 
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find a subset E of of measure exactly k. We now define the shift maps T^, : X' 

X' as follows. The map is a trivial lift of Ta, thus 

■= {TaX,i) 

for X E X and ^ e {1, 2}. The map is an almost trivial lift of Ti,. Namely, we dehne 

Tl{x,i) := (ThX^i) 

for X e X\E and i e {1, 2}, but dehne 

Tl{x,i) := {TbX,3- i) 

ioi X E E and i e {1, 2}; see Figure Finally, we partition X' = X'^ u u Xq where 
X; := X„ X {1,2}, X' := Xb x {1,2}, X' := Xq x {1,2}. We then dehne T'g for the 
remaining e F 2 in the usual fashion. 

Proposition 4.1 (Good system). If k is sufficiently small, then {X',X',ia', {Tg)g^p^) is 
a good system with /i'(X') = 2. 


Proof. Axioms (i) and (ii) are easily verihed, so we focus on verifying Axioms (iii) and 
(iv). 

By Axiom (iii) for X, X^ is partitioned into hnitely many T^-invariant components 
Xq 1 ,..., Xq m of positive measure, each of which is T^-ergodic. This induces a partition 
of X^ into the 2m components Xa,i x {1}, .. .,Xa,m x {l},^a,i x {2},.. .,Xa,m x {2}, 
and each of these components are clearly TGergodic. 

Now we verify Axiom (iv). We need to show that X' = UgsFa Tgi^a,i X {j}) up tO null 
sets for each i = 1,... ,m and j = 1,2. Denote the right-hand side by Y, thus Y is 
F 2 -invariant and contains X^ j x {j}. On the other hand, by Axiom (iv) for X and the 
pigeonhole principle, there exists g e E 2 such that TgXa,i intersects E in a. set of positive 
measure. We may assume that the word length \g\ of g is minimal among all g with this 
property, thus ThXa,i nE is null whenever \h\ < l^fj. From this we see that Tg{Xa,i x {j}) 
intersects E x {j} in a set of positive measure (since the dynamics of T' are just a trivial 
lift of the dynamics of T outside oi E x {1,2}). From construction of T^, this implies 
that Tlg{Xa^i X {j}) intersects TbE x {3 — j} c X^ x {3 — j} in a set of positive measure, 
and hence by Axiom (ii) the union of x {j}) and ^ id}) intersects 

Xq X {3 — j} in a set of positive measure; in particular, Y intersects X^ x {3 — j} in a 
set of positive measure. As Y is (T^)^-invariant, we conclude from Axiom (iii) that Y 
contains Xa^r x {3 — j} up to null sets for some i' = 1,... ,m. 

Next, by another appeal to Axiom (iv) and the pigeonhole principle, we can hnd gi^ii e E 2 
such that Tg.gXa^i' and X^^j intersect in a set of positive measure. Note that as there 
are only m choices for i', the word length of gi^i' can be bounded above, and the measure 
of Tg.gXa^ii n Xa j bounded below, by quantities independent of k. Because of this, we 
see that if n (and hence E) is small enough, then Tg , (Xa^j/ x {3 — j}) and Xq^, x {3 — j} 
also intersect in a set of positive measure; thus Y must intersect Xa,i x {3 — j} in a set 
of positive measure, and hence by the TGergodicity of X^ j, Y contains X^ j x {3 — j} up 


POINTWISE ERGODIC THEOREM FAILS FOR F 2 


13 



Figure 3. The good system (X', A", /i', (T^ggir^), which is formed by 
gluing together two barely interacting copies of (X, A,/i, {Tg)geF 2 )- 


to null sets. Since Y already contained X^ j x {j}, we thus have X^ j x {1,2} contained 
in Y up to null sets. 

Now for any {x,j') e X', we have from Axiom (iv) that x = Tg-y for some y e X^,* and 
g e F 2 . This implies that {x,j') = Tg{y,j") for some j" e {1,2}, and hence {x,j') e 
for almost every {x,j') e X, which gives Axiom (iv) for X' as required. 


□ 
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We let M be a large natural number, depending on all previous quantities (in particular, 
depending on k), to be chosen later. The functions e L^(X') will be dehned for 
negative n by the formulae 

:= fn{x,s) 

and 

fn{x, 2, S) := (^1 - I) fn-2M{x, s) 

for any x e X and s e {a, b, a~^, b~^}. Informally, f!^ is two copies of f^, one over X x {1} 
and one over X x {2}, with the latter experiencing a signihcant time delay and also a 
slight reduction in amplitude; the point is that we can delay the X x {2} dynamics until 
the dynamics of X x {1} has mixed almost completely, so that half of the mass of the 
X X {1} component is spread out almost uniformly over X x {2}, allowing for the crucial 
amplitude reduction for the X x {2} component. The idea behind this construction is 
due to Ornstein uni Lemma 4]. 

Clearly, Axiom (vii’) is a consequence of Axiom (vii) (we allow the constant A! to depend 
on M). For functions supported on Xg, the Markov operator P' is a trivial lift of the 
Markov operator P, so (from Axiom (vii’)) one sees that = P'fn for all n ^ —2. 
We now dehne f'^ for non-negative n by setting 

fn ■■= 

SO that Axiom (v’) holds. Clearly the are non-negative and in L°°, and direct cal¬ 
culation shows that Axiom (vi’) holds for all negative n, and hence for all n thanks to 
Axiom (v’). 

The only remaining task is to show Axiom (viii’). By the union bound, it suffices to 
show the bounds on X x {1} and X x {2} separately. More precisely, we establish the 
following two propositions. 

Proposition 4.2. If k is sufficiently small (depending on e,N, and the fn, but without 
any dependence on M), we have 

sup7r=,/2„(x, 1) ^ 1 - e: 

for all X e X outside of a set of measure at most e. 


Proof. By construction, we have 

fixA,s) = fn{x,s) 

for negative n, all x e X, and s e {a, b, a~^, b~^}. Now we turn to non-negative n. Note 
that as P is a contraction on L”, the fn for non-negative n are uniformly bounded in 
by some quantity B independent of k. A routine induction then shows that 



max(/„(a:, s) - f({x, 1, s), 0) dp.{x, s) < CB,ni^ 


for all non-negative n and some quantity CB,n that depends on B,n but not on k] this 
is basically because on X x {1} x {a, 6, a“^, the Markov process associated to P' 
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only differs from that associated to P on the set E x {1} x {6}u TbE x {1} u {b 
which has measure k/2. Applying and then the triangle inequality, we conclude that 

maxi sup ir^hnix)- sup 7r=,/2„(a;, 1), 0 ) djj{x) C'^ 

Jx \-NiiniiN -NiiniiN J 

for some C'^ ^ independent of a; in particular, from Markov’s inequality we see (for k 
small enough) that 


sup 7l^f2n{x) - sup X^f^nix, 1) ^ e/?, 

-Ns^n^N -Ni^n^N 


for all a; e A outside of a set of measure at most e/3. 
obtain the claim. 


Combining this with (4.1), we 

□ 


Proposition 4.3. If k is sufficiently small (depending on e,N, and the fn, but without 
any dependence on M), we have 

sup7r=i=/2n(a^,2) ^ 1 - £ 

for all X e X outside of a set of measure at most e. 


Proof. We split 

/n = fn,l + /n,2 

where for negative n, fP is the restriction of f( to X x {i} x {a,b,a~^,b~^}, and for 
non-negative n, fP is propagated by P'\ 

fn, ■■= 

Observe that the f( ^ component of f( does not depend on M. 


From Lemma 
the constant 


2.3 


we see that f( i 


converges pointwise almost everywhere as n 


1 

iW) 



dft' 



dpL 


a 

2 ' 


00 to 


In particular, ir^ff ^ converges pointwise almost everywhere to the same constant. Thus, 
by Egorov’s theorem, and assuming M sufficiently large (depending on previous quan¬ 
tities such as e, k, and the fn, but without any circular dependency of M on itself) we 
have 


inf 

n^2M-2N 


n/ / \ 0.8 

’r./„,iL,2) S ,r - 5 


(4.2) 


for all a: e X outside of a set of measure at most e/3. 


Now we work on /,( 2 - ^ < 2M, an induction (using Axiom (vii)) shows that f/ 2 

is supported on Xq x {2} x {a, b, a~^, b~^}, and that 

fn, 2ix, 2, s) = (^1 - I) fn-2M{x, s) 
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for all X e X and s e {a, b, a ^,b Repeating the argnments used to prove Proposition 


4.2, we see (if k is sufficiently small depending on e,iV, but (crucially) without any 

^ ^/3 


dependence on M) that 
a 


1 - 


sup n^f 2 n- 2 M{x) “ SUp 
M-N^n^M+N M-N^n^M+N 


for all X e X outside of a set of measure at most e/?>. Combining this with (4.2), we see 
that 

7/ ! n\ ^ ^ /I J- r \ 

sup 7r=,/„(x, 2) ^ - + (1 - sup Tr^f 2 n{x) - — 

n Z Z —N^n^N ^ 


for all X e X outside of a set of measure at most 2e/3. Applying (4.1), we then obtain 
the claim. □ 


The proof of Theorem 2.6 and thus Theorem 1.2, is now complete. 


References 

[ll L. Bowen, A. Nevo, Pointwise erqodic theorems beyond amenable qroups, Ereodic Theory Dynam. 
Systems 33 (2013), no. 3, 777-820. 

[2] L. Bowen, A. Nevo, Amenable equivalence relations and the construction of ergodic averages for 
group actions, preprint. 

[3] A. Bufetov, Convergence of spherical averages for actions of free groups, Ann. of Math. (2) 155 
(2002), no. 3, 929-944. 

[4] K. Fujiwara, A. Nevo, Maximal and pointwise ergodic theorems for word-hyperbolic groups, Ergodic 
Theory Dynam. Systems 18 (1998), no. 4, 843-858. 

[5] E. Lindenstrauss, Pointwise theorems for amenable groups, Invent. Math. 146 (2001), no. 2, 259- 
295. 

[6] G. A. Margulis, A. Nevo, E. Stein, Analogs of Wiener’s ergodic theorems for semisimple Lie groups. 
II, Duke Math. J. 103 (2000), no. 2, 233-259. 

[7] A. Naor, T. Tao, Naor, Random martingales and localization of maximal inequalities, J. Funct. 
Anal. 259 (2010), no. 3, 731-779. 

[8] A. Nevo, E. Stein, A generalization of Birkhoff’s pointwise ergodic theorem, Acta Math. 173 
(1994), no. 1, 135-154. 

[91 A. Nevo, E. Stein, Analogs of Wiener’s erqodic theorems for semisimple qroups. I, Ann. of Math. 
(2) 145 (1997), no. 3, 565-595. 

[lOl D. Ornstein, On the pointwise behavior of iterates of a self-adjoint operator, J. Math. Mech. 18 
(1968/1969), 473-477. 

[11] G.-C. Rota, An “Alternierende Verfahren” for general positive operators. Bull. Amer. Math. Soc. 
68 (1962), 95-102. 

[12] E. M. Stein, On the maximal ergodic theorem, Proc. Nat. Acad. Sci. U.S.A. 47 (1961) 1894-1897. 


UCLA Department of Mathematics, Los Angeles, CA 90095-1555. 
E-mail address: tao@math.ucla.edu 







